Overview
Brought to you by YData
Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 750000 |
| Missing cells | 233124 |
| Missing cells (%) | 1.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 64.4 MiB |
| Average record size in memory | 90.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 2 |
| Boolean | 10 |
Episode_Length_minutes is highly overall correlated with Listening_Time_minutes | High correlation |
Genre_Business is highly overall correlated with Podcast_Name | High correlation |
Genre_Music is highly overall correlated with Podcast_Name | High correlation |
Genre_True Crime is highly overall correlated with Podcast_Name | High correlation |
Listening_Time_minutes is highly overall correlated with Episode_Length_minutes | High correlation |
Podcast_Name is highly overall correlated with Genre_Business and 2 other fields | High correlation |
Genre_Business is highly imbalanced (50.8%) | Imbalance |
Genre_Comedy is highly imbalanced (50.4%) | Imbalance |
Genre_Education is highly imbalanced (65.1%) | Imbalance |
Genre_Health is highly imbalanced (54.6%) | Imbalance |
Genre_Lifestyle is highly imbalanced (50.0%) | Imbalance |
Genre_Music is highly imbalanced (58.5%) | Imbalance |
Genre_News is highly imbalanced (58.2%) | Imbalance |
Episode_Length_minutes has 87093 (11.6%) missing values | Missing |
Guest_Popularity_percentage has 146030 (19.5%) missing values | Missing |
Podcast_Name has 17327 (2.3%) zeros | Zeros |
Publication_Day has 108237 (14.4%) zeros | Zeros |
Number_of_Ads has 217592 (29.0%) zeros | Zeros |
Listening_Time_minutes has 8551 (1.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-24 11:49:18.432688 |
|---|---|
| Analysis finished | 2025-04-24 11:49:58.649863 |
| Duration | 40.22 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Podcast_Name
Real number (ℝ)
High correlation  Zeros 
| Distinct | 48 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.515731 |
| Minimum | 0 |
|---|---|
| Maximum | 47 |
| Zeros | 17327 |
| Zeros (%) | 2.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 11 |
| median | 23 |
| Q3 | 37 |
| 95-th percentile | 45 |
| Maximum | 47 |
| Range | 47 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 14.137577 |
|---|---|
| Coefficient of variation (CV) | 0.60119657 |
| Kurtosis | -1.2686177 |
| Mean | 23.515731 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.015133322 |
| Sum | 17636798 |
| Variance | 199.87108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 42 | 22847 | 3.0% |
| 39 | 20053 | 2.7% |
| 15 | 19635 | 2.6% |
| 43 | 19549 | 2.6% |
| 14 | 19488 | 2.6% |
| 3 | 19480 | 2.6% |
| 41 | 19364 | 2.6% |
| 17 | 19272 | 2.6% |
| 30 | 18889 | 2.5% |
| 6 | 17735 | 2.4% |
| Other values (38) | 553688 |
| Value | Count | Frequency (%) |
| 0 | 17327 | |
| 1 | 11543 | |
| 2 | 17012 | |
| 3 | 19480 | |
| 4 | 15927 | |
| 5 | 17374 | |
| 6 | 17735 | |
| 7 | 13138 | |
| 8 | 13391 | |
| 9 | 17452 |
| Value | Count | Frequency (%) |
| 47 | 14043 | |
| 46 | 15009 | |
| 45 | 17254 | |
| 44 | 16373 | |
| 43 | 19549 | |
| 42 | 22847 | |
| 41 | 19364 | |
| 40 | 13053 | |
| 39 | 20053 | |
| 38 | 16191 |
Episode_Title
Real number (ℝ)
| Distinct | 100 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.445811 |
| Minimum | 1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 28 |
| median | 52 |
| Q3 | 75 |
| 95-th percentile | 95 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 47 |
Descriptive statistics
| Standard deviation | 28.085623 |
|---|---|
| Coefficient of variation (CV) | 0.54592633 |
| Kurtosis | -1.1612279 |
| Mean | 51.445811 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | -0.061889975 |
| Sum | 38584358 |
| Variance | 788.8022 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 71 | 10515 | 1.4% |
| 62 | 10373 | 1.4% |
| 31 | 10292 | 1.4% |
| 61 | 9991 | 1.3% |
| 69 | 9864 | 1.3% |
| 23 | 9762 | 1.3% |
| 63 | 9743 | 1.3% |
| 81 | 9741 | 1.3% |
| 64 | 9686 | 1.3% |
| 72 | 9554 | 1.3% |
| Other values (90) | 650479 |
| Value | Count | Frequency (%) |
| 1 | 5922 | |
| 2 | 5134 | |
| 3 | 6943 | |
| 4 | 7000 | |
| 5 | 6366 | |
| 6 | 6993 | |
| 7 | 6369 | |
| 8 | 7690 | |
| 9 | 6751 | |
| 10 | 6454 |
| Value | Count | Frequency (%) |
| 100 | 6348 | |
| 99 | 9270 | |
| 98 | 5902 | |
| 97 | 6521 | |
| 96 | 6720 | |
| 95 | 4838 | |
| 94 | 6763 | |
| 93 | 5919 | |
| 92 | 6533 | |
| 91 | 6975 |
Episode_Length_minutes
Real number (ℝ)
High correlation  Missing 
| Distinct | 12268 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 87093 |
| Missing (%) | 11.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.504738 |
| Minimum | 0 |
|---|---|
| Maximum | 325.24 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12.68 |
| Q1 | 35.73 |
| median | 63.84 |
| Q3 | 94.07 |
| 95-th percentile | 115.29 |
| Maximum | 325.24 |
| Range | 325.24 |
| Interquartile range (IQR) | 58.34 |
Descriptive statistics
| Standard deviation | 32.969603 |
|---|---|
| Coefficient of variation (CV) | 0.51111909 |
| Kurtosis | -1.2030327 |
| Mean | 64.504738 |
| Median Absolute Deviation (MAD) | 29.16 |
| Skewness | -0.0020056126 |
| Sum | 42760643 |
| Variance | 1086.9947 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.6 | 925 | 0.1% |
| 34.4 | 617 | 0.1% |
| 30.69 | 576 | 0.1% |
| 31.68 | 533 | 0.1% |
| 31.46 | 491 | 0.1% |
| 47.02 | 461 | 0.1% |
| 29.61 | 448 | 0.1% |
| 106.52 | 426 | 0.1% |
| 111.68 | 420 | 0.1% |
| 114.98 | 411 | 0.1% |
| Other values (12258) | 657599 | |
| (Missing) | 87093 | 11.6% |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1.24 | 1 | < 0.1% |
| 1.48 | 1 | < 0.1% |
| 1.84 | 1 | < 0.1% |
| 2.47 | 4 | < 0.1% |
| 2.97 | 1 | < 0.1% |
| 5 | 38 | |
| 5.0000636 | 1 | < 0.1% |
| 5.00006409 | 6 | < 0.1% |
| 5.00006607 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 325.24 | 1 | < 0.1% |
| 120.93 | 1 | < 0.1% |
| 120.73 | 1 | < 0.1% |
| 120.64 | 2 | < 0.1% |
| 120.37 | 2 | < 0.1% |
| 120.32 | 1 | < 0.1% |
| 120.06 | 1 | < 0.1% |
| 119.99 | 7 | < 0.1% |
| 119.98 | 55 | |
| 119.97 | 44 |
Host_Popularity_percentage
Real number (ℝ)
| Distinct | 8038 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.859901 |
| Minimum | 1.3 |
|---|---|
| Maximum | 119.46 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 MiB |
Quantile statistics
| Minimum | 1.3 |
|---|---|
| 5-th percentile | 24.79 |
| Q1 | 39.41 |
| median | 60.05 |
| Q3 | 79.53 |
| 95-th percentile | 95.77 |
| Maximum | 119.46 |
| Range | 118.16 |
| Interquartile range (IQR) | 40.12 |
Descriptive statistics
| Standard deviation | 22.873098 |
|---|---|
| Coefficient of variation (CV) | 0.38211052 |
| Kurtosis | -1.2067021 |
| Mean | 59.859901 |
| Median Absolute Deviation (MAD) | 20.04 |
| Skewness | 0.0049262753 |
| Sum | 44894926 |
| Variance | 523.17859 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38.68 | 560 | 0.1% |
| 26.72 | 523 | 0.1% |
| 56.29 | 490 | 0.1% |
| 30.14 | 445 | 0.1% |
| 31.57 | 439 | 0.1% |
| 58.71 | 431 | 0.1% |
| 80.43 | 428 | 0.1% |
| 67.54 | 411 | 0.1% |
| 36.79 | 410 | 0.1% |
| 67.19 | 401 | 0.1% |
| Other values (8028) | 745462 |
| Value | Count | Frequency (%) |
| 1.3 | 1 | < 0.1% |
| 1.47 | 1 | < 0.1% |
| 1.73 | 1 | < 0.1% |
| 1.77 | 2 | < 0.1% |
| 1.89 | 2 | < 0.1% |
| 2.95 | 2 | < 0.1% |
| 20 | 18 | < 0.1% |
| 20.01 | 69 | |
| 20.02 | 42 | |
| 20.03 | 62 |
| Value | Count | Frequency (%) |
| 119.46 | 1 | < 0.1% |
| 118.93 | 1 | < 0.1% |
| 118.73 | 1 | < 0.1% |
| 118.69 | 1 | < 0.1% |
| 117.76 | 2 | < 0.1% |
| 117.14 | 5 | |
| 115.18 | 1 | < 0.1% |
| 114.97 | 1 | < 0.1% |
| 114.73 | 1 | < 0.1% |
| 112.44 | 1 | < 0.1% |
Publication_Day
Real number (ℝ)
Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.962776 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 108237 |
| Zeros (%) | 14.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.9973986 |
|---|---|
| Coefficient of variation (CV) | 0.67416456 |
| Kurtosis | -1.2339942 |
| Mean | 2.962776 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.035920341 |
| Sum | 2222082 |
| Variance | 3.989601 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 115946 | |
| 1 | 111963 | |
| 0 | 108237 | |
| 6 | 107886 | |
| 4 | 104360 | |
| 2 | 103505 | |
| 5 | 98103 |
| Value | Count | Frequency (%) |
| 0 | 108237 | |
| 1 | 111963 | |
| 2 | 103505 | |
| 3 | 115946 | |
| 4 | 104360 | |
| 5 | 98103 | |
| 6 | 107886 |
| Value | Count | Frequency (%) |
| 6 | 107886 | |
| 5 | 98103 | |
| 4 | 104360 | |
| 3 | 115946 | |
| 2 | 103505 | |
| 1 | 111963 | |
| 0 | 108237 |
Publication_Time
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 3 | |
|---|---|
| 1 | |
| 0 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 196849 | |
| 1 | 195778 | |
| 0 | 179460 | |
| 2 | 177913 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 196849 | |
| 1 | 195778 | |
| 0 | 179460 | |
| 2 | 177913 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 196849 | |
| 1 | 195778 | |
| 0 | 179460 | |
| 2 | 177913 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 750000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 196849 | |
| 1 | 195778 | |
| 0 | 179460 | |
| 2 | 177913 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 750000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 196849 | |
| 1 | 195778 | |
| 0 | 179460 | |
| 2 | 177913 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 750000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 196849 | |
| 1 | 195778 | |
| 0 | 179460 | |
| 2 | 177913 |
Guest_Popularity_percentage
Real number (ℝ)
Missing 
| Distinct | 10019 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 146030 |
| Missing (%) | 19.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.236449 |
| Minimum | 0 |
|---|---|
| Maximum | 119.91 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5.79 |
| Q1 | 28.38 |
| median | 53.58 |
| Q3 | 76.6 |
| 95-th percentile | 95.1 |
| Maximum | 119.91 |
| Range | 119.91 |
| Interquartile range (IQR) | 48.22 |
Descriptive statistics
| Standard deviation | 28.451241 |
|---|---|
| Coefficient of variation (CV) | 0.54466263 |
| Kurtosis | -1.1501171 |
| Mean | 52.236449 |
| Median Absolute Deviation (MAD) | 24.23 |
| Skewness | -0.10703539 |
| Sum | 31549248 |
| Variance | 809.47314 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 68.53 | 378 | 0.1% |
| 29.7 | 339 | < 0.1% |
| 42.69 | 332 | < 0.1% |
| 54.59 | 300 | < 0.1% |
| 41.29 | 298 | < 0.1% |
| 71.4 | 296 | < 0.1% |
| 84.57 | 285 | < 0.1% |
| 65.16 | 284 | < 0.1% |
| 70.99 | 283 | < 0.1% |
| 69.72 | 281 | < 0.1% |
| Other values (10009) | 600894 | |
| (Missing) | 146030 | 19.5% |
| Value | Count | Frequency (%) |
| 0 | 3 | < 0.1% |
| 0.01 | 47 | |
| 0.02 | 13 | < 0.1% |
| 0.03 | 27 | < 0.1% |
| 0.04 | 88 | |
| 0.05 | 12 | < 0.1% |
| 0.06 | 82 | |
| 0.07 | 86 | |
| 0.08 | 16 | < 0.1% |
| 0.09 | 51 |
| Value | Count | Frequency (%) |
| 119.91 | 1 | |
| 115.62 | 2 | |
| 115.43 | 1 | |
| 115.41 | 1 | |
| 114.88 | 1 | |
| 114.72 | 2 | |
| 110.14 | 1 | |
| 107.81 | 2 | |
| 107.58 | 1 | |
| 107.34 | 1 |
Number_of_Ads
Real number (ℝ)
Zeros 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3488549 |
| Minimum | 0 |
|---|---|
| Maximum | 103.91 |
| Zeros | 217592 |
| Zeros (%) | 29.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 103.91 |
| Range | 103.91 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.1511304 |
|---|---|
| Coefficient of variation (CV) | 0.85341306 |
| Kurtosis | 505.89391 |
| Mean | 1.3488549 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 6.0329918 |
| Sum | 1011639.8 |
| Variance | 1.3251012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 217592 | |
| 1 | 214069 | |
| 3 | 160173 | |
| 2 | 158156 | |
| 103.25 | 2 | < 0.1% |
| 53.37 | 1 | < 0.1% |
| 103.91 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 53.42 | 1 | < 0.1% |
| 103.75 | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 217592 | |
| 1 | 214069 | |
| 2 | 158156 | |
| 3 | 160173 | |
| 12 | 1 | < 0.1% |
| 53.37 | 1 | < 0.1% |
| 53.42 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 103.25 | 2 | < 0.1% |
| 103.75 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 103.91 | 1 | < 0.1% |
| 103.88 | 1 | < 0.1% |
| 103.75 | 1 | < 0.1% |
| 103.25 | 2 | < 0.1% |
| 103 | 1 | < 0.1% |
| 53.42 | 1 | < 0.1% |
| 53.37 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 3 | 160173 | |
| 2 | 158156 |
Episode_Sentiment
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 36.0 MiB |
| 0 | |
|---|---|
| -1 | |
| 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.333488 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | -1 |
| 3rd row | -1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 251291 | |
| -1 | 250116 | |
| 1 | 248593 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 498709 | |
| 0 | 251291 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 498709 | |
| 0 | 251291 | |
| - | 250116 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000116 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 498709 | |
| 0 | 251291 | |
| - | 250116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000116 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 498709 | |
| 0 | 251291 | |
| - | 250116 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000116 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 498709 | |
| 0 | 251291 | |
| - | 250116 |
Listening_Time_minutes
Real number (ℝ)
High correlation  Zeros 
| Distinct | 42807 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.437406 |
| Minimum | 0 |
|---|---|
| Maximum | 119.97 |
| Zeros | 8551 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6.07879 |
| Q1 | 23.17835 |
| median | 43.37946 |
| Q3 | 64.81158 |
| 95-th percentile | 93.67793 |
| Maximum | 119.97 |
| Range | 119.97 |
| Interquartile range (IQR) | 41.63323 |
Descriptive statistics
| Standard deviation | 27.138306 |
|---|---|
| Coefficient of variation (CV) | 0.59726794 |
| Kurtosis | -0.66123629 |
| Mean | 45.437406 |
| Median Absolute Deviation (MAD) | 20.75602 |
| Skewness | 0.35081226 |
| Sum | 34078055 |
| Variance | 736.48764 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8551 | 1.1% |
| 5.82 | 124 | < 0.1% |
| 10.55 | 108 | < 0.1% |
| 8.75 | 108 | < 0.1% |
| 19.71 | 98 | < 0.1% |
| 6.16 | 98 | < 0.1% |
| 7.92 | 97 | < 0.1% |
| 14.93 | 97 | < 0.1% |
| 11.91 | 93 | < 0.1% |
| 12.78 | 92 | < 0.1% |
| Other values (42797) | 740534 |
| Value | Count | Frequency (%) |
| 0 | 8551 | |
| 0.00056 | 7 | < 0.1% |
| 0.00175 | 8 | < 0.1% |
| 0.00661 | 18 | < 0.1% |
| 0.0105 | 7 | < 0.1% |
| 0.01077 | 24 | < 0.1% |
| 0.01257 | 30 | < 0.1% |
| 0.0296 | 15 | < 0.1% |
| 0.03228 | 16 | < 0.1% |
| 0.0343 | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 119.97 | 22 | |
| 119.9 | 16 | |
| 119.8 | 18 | |
| 119.79 | 14 | |
| 119.78 | 17 | |
| 119.74 | 15 | |
| 119.73 | 14 | |
| 119.67 | 12 | |
| 119.66 | 22 | |
| 119.56 | 17 |
Genre_Business
Boolean
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 669479 | |
| True | 80521 | 10.7% |
Genre_Comedy
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 668547 | |
| True | 81453 | 10.9% |
Genre_Education
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True | 49100 |
| Value | Count | Frequency (%) |
| False | 700900 | |
| True | 49100 | 6.5% |
Genre_Health
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 678584 | |
| True | 71416 | 9.5% |
Genre_Lifestyle
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 667539 | |
| True | 82461 | 11.0% |
Genre_Music
Boolean
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True | 62743 |
| Value | Count | Frequency (%) |
| False | 687257 | |
| True | 62743 | 8.4% |
Genre_News
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True | 63385 |
| Value | Count | Frequency (%) |
| False | 686615 | |
| True | 63385 | 8.5% |
Genre_Sports
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 662394 | |
| True | 87606 | 11.7% |
Genre_Technology
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 663744 | |
| True | 86256 | 11.5% |
Genre_True Crime
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 732.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 664941 | |
| True | 85059 | 11.3% |
Interactions
Correlations
| Episode_Length_minutes | Episode_Sentiment | Episode_Title | Genre_Business | Genre_Comedy | Genre_Education | Genre_Health | Genre_Lifestyle | Genre_Music | Genre_News | Genre_Sports | Genre_Technology | Genre_True Crime | Guest_Popularity_percentage | Host_Popularity_percentage | Listening_Time_minutes | Number_of_Ads | Podcast_Name | Publication_Day | Publication_Time | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Episode_Length_minutes | 1.000 | 0.022 | -0.020 | 0.006 | 0.014 | 0.006 | 0.008 | 0.005 | 0.014 | 0.005 | 0.014 | 0.012 | 0.002 | -0.009 | 0.024 | 0.932 | -0.058 | 0.006 | 0.007 | 0.013 |
| Episode_Sentiment | 0.022 | 1.000 | 0.010 | 0.002 | 0.003 | 0.004 | 0.004 | 0.008 | 0.003 | 0.010 | 0.001 | 0.008 | 0.009 | 0.013 | 0.014 | 0.037 | 0.002 | 0.012 | 0.009 | 0.011 |
| Episode_Title | -0.020 | 0.010 | 1.000 | 0.010 | 0.022 | 0.015 | 0.011 | 0.011 | 0.015 | 0.017 | 0.019 | 0.018 | 0.017 | 0.042 | 0.019 | -0.022 | 0.006 | 0.006 | 0.003 | 0.005 |
| Genre_Business | 0.006 | 0.002 | 0.010 | 1.000 | 0.121 | 0.092 | 0.112 | 0.122 | 0.105 | 0.105 | 0.126 | 0.125 | 0.124 | 0.016 | 0.014 | 0.010 | 0.000 | 0.569 | 0.012 | 0.006 |
| Genre_Comedy | 0.014 | 0.003 | 0.022 | 0.121 | 1.000 | 0.092 | 0.113 | 0.123 | 0.105 | 0.106 | 0.127 | 0.126 | 0.125 | 0.020 | 0.011 | 0.016 | 0.001 | 0.468 | 0.012 | 0.006 |
| Genre_Education | 0.006 | 0.004 | 0.015 | 0.092 | 0.092 | 1.000 | 0.086 | 0.093 | 0.080 | 0.080 | 0.096 | 0.095 | 0.095 | 0.007 | 0.011 | 0.018 | 0.000 | 0.300 | 0.010 | 0.007 |
| Genre_Health | 0.008 | 0.004 | 0.011 | 0.112 | 0.113 | 0.086 | 1.000 | 0.114 | 0.098 | 0.099 | 0.118 | 0.117 | 0.116 | 0.012 | 0.014 | 0.015 | 0.001 | 0.426 | 0.011 | 0.004 |
| Genre_Lifestyle | 0.005 | 0.008 | 0.011 | 0.122 | 0.123 | 0.093 | 0.114 | 1.000 | 0.106 | 0.107 | 0.128 | 0.127 | 0.126 | 0.009 | 0.014 | 0.006 | 0.000 | 0.453 | 0.011 | 0.003 |
| Genre_Music | 0.014 | 0.003 | 0.015 | 0.105 | 0.105 | 0.080 | 0.098 | 0.106 | 1.000 | 0.092 | 0.110 | 0.109 | 0.108 | 0.005 | 0.009 | 0.014 | 0.004 | 0.510 | 0.004 | 0.004 |
| Genre_News | 0.005 | 0.010 | 0.017 | 0.105 | 0.106 | 0.080 | 0.099 | 0.107 | 0.092 | 1.000 | 0.110 | 0.110 | 0.109 | 0.013 | 0.010 | 0.013 | 0.000 | 0.413 | 0.006 | 0.012 |
| Genre_Sports | 0.014 | 0.001 | 0.019 | 0.126 | 0.127 | 0.096 | 0.118 | 0.128 | 0.110 | 0.110 | 1.000 | 0.131 | 0.130 | 0.019 | 0.018 | 0.025 | 0.000 | 0.462 | 0.010 | 0.003 |
| Genre_Technology | 0.012 | 0.008 | 0.018 | 0.125 | 0.126 | 0.095 | 0.117 | 0.127 | 0.109 | 0.110 | 0.131 | 1.000 | 0.129 | 0.014 | 0.015 | 0.021 | 0.000 | 0.349 | 0.016 | 0.003 |
| Genre_True Crime | 0.002 | 0.009 | 0.017 | 0.124 | 0.125 | 0.095 | 0.116 | 0.126 | 0.108 | 0.109 | 0.130 | 0.129 | 1.000 | 0.012 | 0.012 | 0.014 | 0.003 | 0.654 | 0.014 | 0.008 |
| Guest_Popularity_percentage | -0.009 | 0.013 | 0.042 | 0.016 | 0.020 | 0.007 | 0.012 | 0.009 | 0.005 | 0.013 | 0.019 | 0.014 | 0.012 | 1.000 | 0.023 | -0.014 | 0.009 | -0.005 | -0.000 | 0.011 |
| Host_Popularity_percentage | 0.024 | 0.014 | 0.019 | 0.014 | 0.011 | 0.011 | 0.014 | 0.014 | 0.009 | 0.010 | 0.018 | 0.015 | 0.012 | 0.023 | 1.000 | 0.045 | -0.017 | -0.002 | -0.004 | 0.010 |
| Listening_Time_minutes | 0.932 | 0.037 | -0.022 | 0.010 | 0.016 | 0.018 | 0.015 | 0.006 | 0.014 | 0.013 | 0.025 | 0.021 | 0.014 | -0.014 | 0.045 | 1.000 | -0.115 | 0.004 | 0.005 | 0.026 |
| Number_of_Ads | -0.058 | 0.002 | 0.006 | 0.000 | 0.001 | 0.000 | 0.001 | 0.000 | 0.004 | 0.000 | 0.000 | 0.000 | 0.003 | 0.009 | -0.017 | -0.115 | 1.000 | 0.009 | 0.005 | 0.001 |
| Podcast_Name | 0.006 | 0.012 | 0.006 | 0.569 | 0.468 | 0.300 | 0.426 | 0.453 | 0.510 | 0.413 | 0.462 | 0.349 | 0.654 | -0.005 | -0.002 | 0.004 | 0.009 | 1.000 | 0.003 | 0.010 |
| Publication_Day | 0.007 | 0.009 | 0.003 | 0.012 | 0.012 | 0.010 | 0.011 | 0.011 | 0.004 | 0.006 | 0.010 | 0.016 | 0.014 | -0.000 | -0.004 | 0.005 | 0.005 | 0.003 | 1.000 | 0.009 |
| Publication_Time | 0.013 | 0.011 | 0.005 | 0.006 | 0.006 | 0.007 | 0.004 | 0.003 | 0.004 | 0.012 | 0.003 | 0.003 | 0.008 | 0.011 | 0.010 | 0.026 | 0.001 | 0.010 | 0.009 | 1.000 |
Missing values
Sample
| Podcast_Name | Episode_Title | Episode_Length_minutes | Host_Popularity_percentage | Publication_Day | Publication_Time | Guest_Popularity_percentage | Number_of_Ads | Episode_Sentiment | Listening_Time_minutes | Genre_Business | Genre_Comedy | Genre_Education | Genre_Health | Genre_Lifestyle | Genre_Music | Genre_News | Genre_Sports | Genre_Technology | Genre_True Crime | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 34 | 98.0 | NaN | 74.81 | 4 | 3 | NaN | 0.0 | 1 | 31.41998 | False | False | False | False | False | False | False | False | False | True |
| 1 | 24 | 26.0 | 119.80 | 66.95 | 2 | 0 | 75.95 | 2.0 | -1 | 88.01241 | False | True | False | False | False | False | False | False | False | False |
| 2 | 40 | 16.0 | 73.90 | 69.97 | 5 | 1 | 8.97 | 0.0 | -1 | 44.92531 | False | False | True | False | False | False | False | False | False | False |
| 3 | 10 | 45.0 | 67.17 | 57.22 | 1 | 2 | 78.70 | 2.0 | 1 | 46.27824 | False | False | False | False | False | False | False | False | True | False |
| 4 | 31 | 86.0 | 110.51 | 80.07 | 1 | 0 | 58.68 | 3.0 | 0 | 75.61031 | False | False | False | True | False | False | False | False | False | False |
| 5 | 14 | 19.0 | 26.54 | 48.96 | 2 | 0 | NaN | 3.0 | 1 | 22.77047 | False | False | False | True | False | False | False | False | False | False |
| 6 | 6 | 47.0 | 69.83 | 35.82 | 3 | 3 | 39.02 | 0.0 | 0 | 64.75024 | False | False | False | False | False | False | False | False | False | True |
| 7 | 35 | 44.0 | 48.52 | 44.99 | 4 | 3 | 20.12 | 0.0 | 1 | 22.37517 | False | False | False | False | False | False | True | False | False | False |
| 8 | 8 | 32.0 | 105.87 | 69.81 | 1 | 1 | NaN | 2.0 | 0 | 68.00124 | False | False | False | False | False | False | True | False | False | False |
| 9 | 33 | 81.0 | NaN | 82.18 | 4 | 3 | 59.72 | 3.0 | 0 | 45.94761 | False | False | False | False | False | True | False | False | False | False |
| Podcast_Name | Episode_Title | Episode_Length_minutes | Host_Popularity_percentage | Publication_Day | Publication_Time | Guest_Popularity_percentage | Number_of_Ads | Episode_Sentiment | Listening_Time_minutes | Genre_Business | Genre_Comedy | Genre_Education | Genre_Health | Genre_Lifestyle | Genre_Music | Genre_News | Genre_Sports | Genre_Technology | Genre_True Crime | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 749990 | 13 | 61.0 | 114.72 | 83.62 | 3 | 2 | 91.80 | 0.0 | 0 | 61.16847 | True | False | False | False | False | False | False | False | False | False |
| 749991 | 3 | 5.0 | 62.46 | 30.03 | 5 | 0 | NaN | 0.0 | 1 | 53.32434 | True | False | False | False | False | False | False | False | False | False |
| 749992 | 12 | 75.0 | 48.67 | 88.62 | 6 | 1 | 25.65 | 3.0 | 1 | 42.08465 | False | False | False | False | True | False | False | False | False | False |
| 749993 | 41 | 83.0 | 23.52 | 38.14 | 5 | 1 | 86.17 | 0.0 | 0 | 19.71374 | False | False | False | False | True | False | False | False | False | False |
| 749994 | 25 | 67.0 | 8.93 | 85.52 | 2 | 1 | NaN | 1.0 | 0 | 7.39878 | False | True | False | False | False | False | False | False | False | False |
| 749995 | 26 | 25.0 | 75.66 | 69.36 | 2 | 2 | NaN | 0.0 | -1 | 56.87058 | False | False | True | False | False | False | False | False | False | False |
| 749996 | 2 | 21.0 | 75.75 | 35.21 | 2 | 3 | NaN | 2.0 | 0 | 45.46242 | True | False | False | False | False | False | False | False | False | False |
| 749997 | 28 | 51.0 | 30.98 | 78.58 | 4 | 2 | 84.89 | 0.0 | -1 | 15.26000 | False | False | False | False | True | False | False | False | False | False |
| 749998 | 41 | 47.0 | 108.98 | 45.39 | 4 | 2 | 93.27 | 0.0 | -1 | 100.72939 | False | False | False | False | True | False | False | False | False | False |
| 749999 | 38 | 99.0 | 24.10 | 22.45 | 2 | 3 | 36.72 | 0.0 | 0 | 11.94439 | False | False | False | False | False | False | False | True | False | False |